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Abstract. The design of efficient large scale networks is a common prob- 
lem in Operations Research. Some of those, such as electric power grids, 
transportation systems, communication networks, and others, must main- 
tain their efficiency even after several failures, or malicious attacks. In 
this work, we outline a procedure that modifies any given network to 
enhance its robustness, defined as the size of its largest connected com- 
ponent, whilst keeping a high efficiency, which is described in terms of 
the shortest paths among nodes. We also show that this generated set of 
networks is very similar to networks optimized for robustness in several 
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' aspects such as high assortativity and an onion-like structure. 
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1 Introduction 

In recent years, network studies have attracted a lot of research. As a result, 
it has been shown that several artificial (e.g. the Internet, electric-grids, etc.) 
and natural systems (e.g. chemical reaction networks, food networks, gene reg- 
ulatory networks, etc.) present characteristics that allows one to classify them 
as Complex Networks. Their structure and the dynamics of phenomena taking 
place on them have been intensively studied, thanks to the availability of large 
data sets [5J. An important aspect of a network, especially in fields where net- 
work functionality is essential such as transportation, production, power, and 
biological networks, among others, is the capability to withstand failures and 
fluctuations in the functionality of its nodes and links. The design of networked 
infrastructures with these capabilities can be thought of as an optimal design 
task and their resilience in the face of targeted failures, such as terrorist attacks 
is still an open problem. An early important work in this field is Albert et al. [T] 
where the authors showed by numerical simulations that scale-free networks of 
the Barabasi- Albert type [3], while they are robust against random removal of 
nodes, are much more vulnerable to the removal of nodes according to node's 
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degree. In other words, in a scale-free graph if the nodes are removed in decreas- 
ing order of degree, starting with the most connected ones, then the network 
falls apart very quickly because those highly connected nodes are the ones that 
hold the network together. 

In the work of Schneider et al. [8] , a procedure is described that successfully 
modifies scale-free networks so that the largest connected component still has 
a considerable size after several attacks targeted at the most connected nodes. 
This feature guarantees that there is at least one path connecting a large number 
of pairs of nodes after the removal and is considered an appropriate definition of 
robustness. A natural question that follows is the maintenance of network effi- 
ciency after attacks, i.e. a network is efficient in this sense if "good paths" among 
nodes do not cease to exist after several targeted failures. Using a consolidated 
definition of efficiency for unweighted networks, we propose an optimization pro- 
cedure that modifies existing networks in order to improve their efficiency under 
targeted attacks. 

This paper is organized as follows. In Section Methodology, we present our 
measures of robustness, efficiency, and a method to optimize a specific char- 
acteristic of a network. Then, we show in Section Results several comparisons 
of optimized and unoptimized networks. We highlight the major points of our 
contribution in Section Discussion. 

2 Methodology 

The proposed methodology is an extension of the work of Schneider et. al [8], 
who used a hill-climbing procedure to optimize a network robustness against 
targeted attacks. In this work, we modify the approach by adding a simulated 
annealing strategy [4] to avoid the search getting trapped in local maxima. This 
procedure is applied to the following objectives: Robustness, Efficiency, and a 
combined measure of both. We describe first the cost functions that we chose, 
followed by the optimization procedure. 

2.1 Robustness 

The definition of network robustness might vary according to a specific applica- 
tion. In this work, we call an attack the removal of a node of the network, and 
the robustness we measure by the size of the largest connected component of the 
network after this removal, as proposed by Schneider et al. [5]. To quantify it, we 
proceed with a series of targeted attacks (sequential removal of nodes starting 
from the highest degree after each interaction) and subsequently measure the 
robustness after each node removal. Hence, we defined the robustness R of a 
network as: 



Q=i 

where N is the network size measured as the number of nodes and s(Q) is the 
fraction of nodes in the largest connected component after Q = qN removals. 
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2.2 Efficiency 

One can think of network efficiency as a low cost of communication among its 
members. In this light, we relate efficiency with the shortest paths between all 
pairs of nodes, thus following Latora and Marchiori [5] who defined the network 
efficiency E as: 



where Uj stands for the shortest path length between nodes i and j. If i and 
j belong to separate connected components of the network, we set Uj — > oo to 
guarantee a consistent behavior of the cost function. 

2.3 Integral Efficiency 

Keeping in mind that we would like to keep the efficiency of networks after 
attacks, it is straightforward to modify the definition of R to account for this. 
Hence, we define the Integral Efficiency IntE as 



where E(Q) stands for the efficiency of the network after the removal of Q = qN 
nodes. In this case, the value of E(0) is the cost function E defined in Section [2721 
Chosing this quantity instead of E, which does not consider nodes removal in its 
definition, we try to avoid that the shortest paths among nodes will not increase 
after targeted attacks. 

2.4 Optimization procedure 

In their work, Schneider et al. [8] propose a simple hill-climbing search to modify 
a network topology in order to optimize the robustness R whilst keeping the 
degree of each node fixed. This restriction in often present in the modification of 
artificial systems, such as electric grids where constructing a receiver for a new 
power line in a station might be impractical. Hence, only swaps between lines 
(edges in the graph) are possible. A theoretical consequence of this restriction is 
that the underlying degree distribution of the network remains unchanged after 
these swaps. Clearly, if we had no constraints on the degree distribution, we could 
design the topology starting from scratch with the robustness and efficiency as 
objectives in mind, obtaining different optimal topologies. However, in this paper 
we shall follow Ref. 0. 

Next, we present an improved version of the optimization approach using 
simulated annealing [4] and we describe it for any measure M that changes after 
link modification: 
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1. Initial State. Let G(N,E) be a network with \N\ vertices and \E\ edges. 

2. Edge swap. Choose two pairs of edges and (k,l) S E randomly and 
create the network G* by deleting the edges (i,j) and (k, I), and adding the 
edges (i, I) and (k,j). 

3. Acceptance probability. Calculate the transition probability p of the sys- 
tem as: 

if M(G*) < M(G) 
[l if M(G*) > M(G) 

4. Comparison. Draw a random number r from U (0, 1). If r < p make G = G* , 
otherwise discard G* . Return to Step 2. 

This approach allows a network G* with M(G*) < M{G) to be chosen with 
finite probability. By doing this, global minima could be reached and inferior 
local minima could be avoided. Furthermore, by decreasing the value of T ac- 
cording to the amount of edge swaps executed, it is possible to decrease the 
acceptance ratio of worst networks when an optimum point is close. 

In this work, we decrease the temperature as function of the number r of 
edge swaps, by following the equation: T(r) = 0.0001 x 0.8 T . Variations to this 
function have shown little effect on the results. The search is stopped when a 
predefined amount of edge swaps is reached. 



3 Results 

The procedure outlined in Section [2^41 is applied to the cost functions: R (Ro- 
bustness as described in Section l2~Tj) . E (Efficiency as described in Section |2~2"|) . 
and IntE (Integral Efficiency as described in Section 12. 3[) , starting from the 
same set of randomly generated Barabasi-Albert networks. Hence, we created 
three sets of networks: Robustness set, Efficiency set, and Integral Efficiency set. 
As a control, we compare to the original set of BA networks which we call the 
Unoptimized set. 

The Unoptimized set is composed of 100 networks of n = 1000 nodes and 
average degree (As) = 5.95. In each of the following plots, the curves represent 
the optimized sets and the Unoptimized set, for comparison. Each point is the 
average value of the optimization procedure using 10.000 edge swaps for every 
network. Data with the final results of the optimization are presented in Table [TJ 

The size of the networks was chosen based on a trade-off between the ap- 
pearance of topological features such as the scale-free phenomenon, only present 
in large networks, and computational cost, as the IntE cost function requires 
0(n 3 ) operations to be calculated. The amount of edge swaps, 10.000, was chosen 
so that for each optimized set its cost function is already statistically different 
from the Unoptimized set. It is possible to see that this goal was achieved by 
comparing the values in bold for columns (E) , (R) , and (IntE) in Table [TJ 

To analyze the robustness of each set, a plot of s(Q) versus Q is shown 
in Fig. [TJ In this plot, the area below each curve represents the cost function 
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Table 1. Average values, standard deviations in subscripts. Each set comprises 100 
networks with n = 1000 nodes, (k) = average degree, (k 2 ) = average squared degree, 
(cc) = average of clustering coefficient, (r) = average assortativity coefficient, (E) = 
average efficiency, (R) = average robustness, (IntE) = average integral efficiency. 



Network Set (k) {cc} (r) (E) (R) (IntE) 

Unoptimized 5.95 0.02 42 . 00 33 - 0.085o.ois 0.1486 .ooi2 0.1837 .oo 5 3 0.03 08 .oon 
OptE 5.95 0.0053 .ooi4 -0.076o.on 0.1539 .ooi5 0.1826 .oos6 0.0310 .ooi2 

OptR 5.95 O.O2OO0.0027 0.038 . 24 0.1459 .ooi3 0.2266 .oo55 0.0372 .ooi2 

OptlntE 5.95 0.0195 .oo2 9 0.055 .o2 6 0.1456 .ooi3 0.2268 .oos2 0.0391 .ooi2 



R for each set. As expected, the Robustness set shows a bigger area (23% of 
increase), keeping a considerable size on the largest cluster of connected nodes 
after several attacks. Indeed, Schneider et al. [8] obtained an improvement of 
almost 75% for this cost function, but by using a much more exhaustive approach: 
their search stops after 10.000 edge-swaps without increase in R. Therefore, our 
results show that it is possible to increase already the network robustness using 
less computational effort. Moreover, it is interesting to note also that the curve 
for the Integral Efficiency set has only a slightly smaller area than R's one. 
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Fig. 1. Plot of the largest component size after the removal of Q nodes. The area bellow 
each curve is the cost function R. Each curve represent sets optimized for different cost 
functions. 
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In Fig. [21 the cost function IntE is analyzed through the plot of E(Q) versus 
Q, showing that, as expected, the Integral Efficiency set has the better perfor- 
mance, i.e. the area under the corresponding curve is bigger. Interestingly, the 
curve referring to the set of networks obtained by optimizing for E alone shows 
that they have about the same performance as the unoptimized ones for this 
cost function (data on Column (IntE) of Table [IJ . 



0.14 



0.12-5 
0.1- r 



0.06- 



S On 
0.04- O <>□ 



5^0 

2 

0.02- g £ 



9 °0< 



5 i 

0.08- 3 O Unoptmized set 

O °§ * Robustness set 



i 5 n V Efficiency set 

§Sn □ Integral Efficiency set 



5o 



100 200 300' ' • 400' ' " ' ' SO^^^^m 

Q 



Fig. 2. Plot of the network efficiency E(Q) after the removal of Q nodes. The area 
bellow each curve is the cost function IntE. Each curve represent sets optimized for 
different cost functions. 



Another interesting aspect of the work of Schneider et al j8] is the topology 
obtained by this optimization: a so-called onion-like structure. In this topology, 
each layer is composed of nodes connected with nodes of the same degree, with 
few connections between layers. A direct procedure to generate this topology 
can be found in the work of Wu et al [5] . 

To investigate the presence of an onion-like structure on our optimized sets, 
three quantities were analyzed. In Fig.|U we show the k-core decomposition [5] for 
several k, showing that the Robustness and Integral Efficiency sets have several 
k-core's or layers, thus confirming a hierarchical structure of the network. The 
Efficiency set does not present this clear hierarchy, but has more layers than the 
Unoptimized set. 

We also measure the robustness for each layer of a network. To do so, we ana- 
lyze the subgraph of each network composed of Nk nodes with degree smaller of 
equal to k. In this subgraph, Sk represents the size of its largest cluster. In Fig. El 
we plot Sk/Nk for several values of k. This plot shows that the Robustness and 
the Integral Efficiency sets present practically the same increase in robustness 
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with respect to the Unoptimized set. In contrast, the Efficiency set does not show 
any improvement with respect to the original scale-free unoptimized networks. 

Finally, in Fig. [5]we show that the Integral Efficiency set and the Robustness 
set of networks have the greater assortativity through the plot of Newman's r 
coefficient [7]; the Efficiency set is as disassortative as the Unoptimized set. 

All this evidence suggests an onion-like structure for the Integral Efficiency 
set. To provide a visualization of the network structure, some examples of each 
set were drawn with the k-core decomposition in Fig. |6l 
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Fig. 3. Relative size of the largest component in networks composed of nodes of degree 
less than k. 



4 Discussion 



In this paper, by describing a process that optimizes a specific characteristic 
in any type of network, we are able to create three sets of BA networks with 
distinguishable features. 

Firstly, our results show that the Integral Efficiency set substantially im- 
proved efficiency after attacks, compared to the Robustness, Efficiency, and Un- 
optimized sets. Moreover, this set also sustains a large connected cluster after 
attacks. Therefore, this cost-function could be used to generate highly robust 
and efficient networks. 

Another important result of our work is that networks optimized for IntE 
also present an onion-like structure. This result suggests that this structure is 
generically the optimal scale-free net independently of the chosen cost function. 
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Fig. 4. K-core decomposition for several values of k. It can be seen that the same 
network optimized for IntE presents more layers than the network resulted after the 
optimization for R. 
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Fig. 5. Box-and- whiskers plot of the degree assortativity through Newman's r coeffi- 
cient. Thick lines depict the median value; lower and higher hinges gives the 0.25 and 
0.75 quantiles, respectively; the whiskers extend to 1.5 times this inter-quantile range. 
Values outside this range are considered outliers and appear as circle dots in the plot. 



IX 




Fig. 6. Examples ol networks belonging to each set. The figures were drawn using the 
k-core decomposition. 
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It also helps the design of networks from scratch, as it is possible to construct 
scale-free networks which present this structure. 

It is also interesting to note that the Integral Efficiency set mantains several 
similarities with the Robustness set, such as: high assortativity, size of the largest 
cluster after attacks, efficiency after attacks, size of the largest cluster for each 
degree layer, and a hierarchical structure regarding the k-core decomposition. In 
fact, the Integral Efficiency set has a slightly better performance on assortativity 
and efficiency after attacks, while the Robustness set has a better performance 
on the others. 

In the future, we intend to focus on the structures of the three generated sets. 
The Efficiency set does not present an onion-like structure, remaining unclear 
if this optimization could lead to a different structure. The Integral Efficiency 
set might have a hidden feature that differentiates it from the Robustness set. 
By finding a typical structure of optimized networks, new networks could be 
designed from scratch with a desired feature. Also, we would like to investigate 
other cost functions that might lead to onion-like structures, and the case of 
weighted networks, as they are closer to real applications. 
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